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Abstract 

Sand flies include a group of insects that are of medical importance and that vary in geographic distribution, ecology, and 
pathogen transmission. Approximately 163 species of sand flies have been reported in Colombia. Surveillance of the 
presence of sand fly species and the actualization of species distribution are important for predicting risks for and 
monitoring the expansion of diseases which sand flies can transmit. Currently, the identification of phlebotomine sand flies 
is based on morphological characters. However, morphological identification requires considerable skills and taxonomic 
expertise. In addition, significant morphological similarity between some species, especially among females, may cause 
difficulties during the identification process. DNA-based approaches have become increasingly useful and promising tools 
for estimating sand fly diversity and for ensuring the rapid and accurate identification of species. A partial sequence of the 
mitochondrial cytochrome oxidase gene subunit I (COI) is currently being used to differentiate species in different animal 
taxa, including insects, and it is referred as a barcoding sequence. The present study explored the utility of the DNA barcode 
approach for the identification of phlebotomine sand flies in Colombia. We sequenced 700 bp of the COI gene from 36 
species collected from different geographic localities. The COI barcode sequence divergence within a single species was 
<2% in most cases, whereas this divergence ranged from 9% to 26.6% among different species. These results indicated that 
the barcoding gene correctly discriminated among the previously morphologically identified species with an efficacy of 
nearly 100%. Analyses of the generated sequences indicated that the observed species groupings were consistent with the 
morphological identifications. In conclusion, the barcoding gene was useful for species discrimination in sand flies from 
Colombia. 
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Introduction 

The subfamily Phlebotominae Rondani, 1 840, is represented in 
America by more than 500 species distributed among the genera 
Lutzomjia Franca, 1924, Brumptomyia Franca and Parrot, 1921, and 
Warileja Hertig, 1948 [1-3]. The Lutzomyia (L.) species are known 
for their role as vectors of medically important pathogens, such as 
the kinetoplastida protozoan of the genus Leishmania Ross, 1903 
[4,5], the bacterium Bartonella bacilliformis (Strong et al. 1907), and 
viruses of the families Bunyaviridae, Reoviridae, and Rhabdoviridae [6— 
8]. 

Although phlebotomine sand flies have been studied more than 
most other groups of insects because of their role as vectors of 
pathogens, the taxonomic knowledge of these insects is far from 
complete. Three major reviews [1,2,9] and the electronic 
documentation produced by the Walter Reed Biosystematics Unit 
(WRBU, http:/ /wrbu. si.edu/) are essential tools for studying the 
systematics, taxonomy, and geographic distribution of this group 
of medically important insects in the Americas. 

To date, in Colombia, 163 species of sand flies (153 Lutzomyia, 8 
Brumptomyia, and 2 Warileya species) have been recorded. This list 
was obtained from taxonomic studies conducted in Colombia in 



different natural and endemic environments, with a primary focus 
on areas of leishmaniasis transmission [10—19]. 

There are approximately 13 species in the genus Lutzomyia that 
have been incriminated as vectors of Leishmania spp. [20] in 
Colombia. Since various vector species often coexist in areas of 
disease transmission, specific identification of the species is useful 
for monitoring and subsequent intervention by officials from the 
Vector-Borne Diseases (ETV) units [21]. 

Currently, the identification of species within the subfamily 
Phlebotominae is based on morphological recognition and 
morphometric analysis of a large number of structures, primarily 
in the head and genitalia [1,2]. However, species identification is 
complicated because it requires a considerable degree of skill and 
taxonomic expertise. In addition, morphological identification 
may be limited by deterioration of samples or by improper 
mounting techniques. 

Molecular methodology is now being used to explore taxonomic 
questions at different hierarchical levels [22,23]. Ribosomal gene 
sequences (18SrDNA and the D2 domain in 28SrDNA) were used 
to reassess the relationships within the family Phlebotominae [24] 
and within the genus Lutzomyia [25-28]. Gene sequences with a 
faster rate of evolution, mitochondrial genes (e.g., cytochrome b 
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Figure 1 . Map of the study area showing the sampling sites of sand flies collected in the present study. Site 1 : Jardin; Site 2: Jerico; Site 
3: San Francisco; Site 4; San Roque; Site 5: Santa Fe de Antioquia; Site 6: Chinchina; Site 7: Salamina; Site 8: Samana; Site 9: Victoria, Site 10: Acandi; Site 
11: Bahi'a Solano, Playa Potes; Site 12: San Andres de Sotavento; Site 13: Ricaute; Site 14: Quimbaya; Site 15: Gramalote; Site 16: Coloso, Serrania de 
Coraza Forestry Reserve; Site 17: Chaparral; Site 18: Alcala; and Site 19: Caho Ariba. 
doi:1 0.1 371 /journal.pone.0085496.g001 



gene and NADH Dehydrogenase 4 (ND4) gene), and nuclear 
genes (e.g., the complete internal transcribed spacer 2 (ITS2) and 
nuclear Elongation Factor 1 -alpha (EF-loc) gene), have been used 
to resolve intraspecific and subgeneric relationships [29-37]. 

Following the proposed use of DNA sequences as a support tool 
for studies of biological diversity and classification [38,39], Hebert 
et al. [40,41] suggested the use and analysis of the 5' fragment of 
the mitochondrial gene Cytochrome C Oxidase Subunit I (COI), 
called a DNA barcode. This region is of potential importance for 
facilitating inventories of biodiversity and performing species' 
identifications. The DNA barcode initiative has been well received 
due to the connectivity and common language of DNA sequences, 
allowing researchers worldwide to advance in taxonomy and 
systematic studies of various groups of organisms, including insect 
disease vectors [42]. The 5' fragment of COI appears to be an 
excellent tool for studying insects of medical importance, as such 
study requires the quick and accurate identification of species that 
are present in a transmission area [28,42,43]. 

Regarding the DNA barcode, complications associated with use 
of this region include nuclear sequences of mitochondrial origin 
(NUMTs) and endosymbionts. Also, standardization of the 
methodology is necessary for each group studied. Furthermore, 
the use of a single marker for the taxonomy of a wide range of taxa 
is controversial [44-48]. 

Despite many demonstrations of the effectiveness of DNA 
barcoding in other taxa, few studies on phlebotomine sand flies in 



Colombia have attempted to use this methodology as a valid 
molecular tool for identifying species of phlebotomine sand flies 
[28,49] and for revealing cryptic species [50-53]. 

In the present study, haplotypes of the barcode sequence were 
assigned to 148 individuals belonging to 36 phlebotomine sand fly 
species collected in Colombia, some of which are involved in the 
transmission of leishmaniasis. Specimens were obtained from sites 
where outbreaks of the disease have occurred. The present study 
evaluated the usefulness of the barcode sequence in species 
differentiation compared with previous morphology-based identi- 
fications. 

Materials and Methods 

Ethics Statement 

Collection of sand flies was done according to the parameters of 
Colombian decree number 1376 http://190.147.21 3.68:8080/ 
HOMEPAGE/ALEGIS_INTER/LEYES_Y_DECRETOS/ 
20 13/DECRETO_l 376_de_2013.pdf which regulates the permit 
for collection of specimens of wild species of biological diversity for 
non-commercial research. No specific permits were required for 
this study. The sand flies were collected on private property, and 
permission was received from land owners prior to sampling. 
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Table 1. List of sand fly species and the numbers of individuals with mitochondrial haplotypes. 





Species 


Number of specimens 


Number of haplotypes 


Number of nucleotidic 
divergences 


B. beaupertuyi 


1 


1 


0 


B. hamata 


2 


2 


1 


B. guimareasi 


3 


2 


2 


B. mesai 


2 


1 


0 


L. antunesi 


2 


2 


3 


L. barretoi majuscula 


3 


2 


3 


L. bifoliata 


2 


1 


0 


L. carpenter! 


2 


1 


0 


L carrerai thula 


5 


4 


16 


L cayennensis cayennensis 


1 


1 


0 


L. Columbiana 


5 


5 


7 


L. coutinhoi 


3 


3 


6 


L. evansi 


3 


3 


9 


L. gomezi 


9 


9 


40 


L. hartmanni 


4 


4 


5 


L. lichyi 


1 


1 


0 


L longiflocosa 


4 


2 


4 


L. longipalpis 


4 


4 


8 


L. (Lutzomyia) sp. 


2 


1 


0 


L. migonei 


4 


2 


1 


L. nuneztovari 


2 


1 


0 


L. panamensis 


7 


6 


38 


L pia 


16 


2 


14 


L reburra 


3 


2 


1 


L scorzai 


9 


7 


28 


L. (Helcocyrtomyia) sp.1 


2 


1 


0 


L. shannon! 


1 


1 


0 


L. spinicrassa 


4 


3 


6 


L. sordelli 


2 


1 


0 


L. trapidoi 


9 


9 


32 


L. trinidadensis 


10 


5 


27 


L. triramula 


7 


6 


22 


L walked 


2 


1 


0 


L. yuilli yuilli 


8 


4 


5 


L. youngi 


1 


1 


0 


W. rotundipennis 


3 


2 


1 


Total 


148 


103 





doi:1 0.1 371 /journal.pone.0085496.t001 



Collection of specimens 

Phlebotomine sand flies were collected using CDC light traps 
and Shannon traps from different locations in Colombia between 
2008 and 2012. Areas selected for the collections were based on 
previous epidemiological studies of leishmaniasis transmission- 
made by the Program for the Study and Control of Tropical 
Diseases (PECET) (Figure 1 and table SI). The thorax and legs 
were removed from each specimen for DNA extraction and stored 
at — 20°C, while the head and abdomen were processed for 
traditional morphological identification [1,2,54]. After being 
assigned an identification code, the remains of each specimen 
were preserved as a collection of voucher specimens. 



DNA extraction and polymerase chain reaction (PCR) 

Total DNA (50 U.L) was extracted from each sample using the 
Qiagen® DNeasy Blood & Tissue kit (Qiagen, Chatsworth, CA). 
To amplify the mitochondrial 5' COI gene region, PCR was 
performed using the following thermal profile: an initial denatur- 
ation of 5 min (94°C); followed by 35 cycles at 94°C for 1 min 
(denaturation), 45°C for 1 min (annealing), and 72°C for 1 min 
(extension); and a final extension at 72°C for 10 min. Each PCR 
cocktail had a final reaction volume of 50 ju.1 and contained the 
following: 0.3 uL of Taq DNA polymerase (5 U/uL) (Biolase®), 
5 uL of 10 x PCR buffer (NH 4 S0 4 ), 5 uL of 2.5 mM MgC12, 
5 \lL of 2 mmol/L dNTP, 2 pi of 1 0 |0.mol/L of each oligonucle- 
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Table 2. K2P pairwise 
species. 


distances 


among Colombian sand fly 




Species 


No. of Specimens K2P values 


B. hamata 


2 


0 001 


B. guimareasi 




n nm n nn^ 

U.UU I — U.UU.5 


B. mesai 


2 


0 000 


L ontunesi 


2 


0 004 


L barretoi majuscula 




0 000 


L bifoiiata 


2 


0 000 


L. carpenteri 


2 


0 000 


L carrerai thula 


c 
D 


n nm n m s 

U.UU I — U.U I o 


L columbiana 


c 
J 


n nm n nn7 

U.UU I —U.UU/ 


L coutinhoi 


■3 
-> 


u.uu i — u.uuy 


L evansi 


■D 


U.UU'f— U.U I z 


L gomezi 


Q 


n nm n n^n 

U.UU I — U.UOU 


L hartmanni 


4 


n nn^ n nns 
U.UU J— u.uuo 


L longiflocosa 




n nnn n nn£ 
u.uuu— U.UUD 


L longipalpis 


4 


n nm nnn 

U.UU I — U.U I z 


L (Lutzomyia) sp. 


2 


0 000 


L migonei 


4 


n nm n nm 

U.UU I —U.UU I 


L. nuneztovari 


2 


0 000 


L panamensis 


7 


n nnn n nAfi 

U.UUU— U.U'fO 


L pia 


1 a 


n nnn n mi 
u.uuu— U.UZ 1 


L reburra 


3 


n nnn n nm 

U.UUU U.UU I 


L scorzai 


9 


0.000-0.026 


L (Heicocyrtomyia) sp.1 


2 


0.000 


L spinicrassa 


4 


0.003-0.007 


L sordeili 


2 


0.001 


L trapidoi 


9 


0.000-0.030 


L trinidadensis 


10 


0.000-0.034 


L triramuia 


7 


0.004-0.024 


L walkeri 


2 


0.000 


L yuilli yuilli 


8 


0.001-0.007 


W. rotundipennis 


3 


0.000-0.001 


doi:1 0.1 371 /journal.pone.0085496.t002 



otide (forward and reverse), 4 |iL of the DNA template, and 
ddH20 to bring the volume to 50 |xl. The oligonucleotides used to 
amplify the 700 bp COI fragment were LCO1490 and HC02198 
[41,55]. 

The amplified fragments were run on an agarose gel at 1 % in 
1 x TBE buffer (40 mM Tris-Borate, 1 mM EDTA, pH 8.0) to 
verify the integrity of the fragments, after which the PCR products 
were purified using a MultiScreeiiHTS Vacuum Manifold (Milli- 
pore). Subsequently, the PCR products of the amplified fragments 
were sent for sequencing in both chain directions to Macrogen Inc. 
(Korea) or to the CDC (Atlanta, GA, USA). 

Data analysis 

The chromatograms obtained were edited using BioEdit 
(http://www.mbio.ncsu.edu/bioedit/bioedit.html) [56] to gener- 
ate a consensus sequence for each specimen. The DNA sequences 
were aligned using the Clustal W tool incorporated into MEGA 
v5.0 software [57]. Each sequence was compared with other insect 
mitochondrial genomes using BLAST (http://blast.iicbi.nlm.nih. 



gov/Blast.cgi) [58] to confirm the identity of the obtained 
fragments (the 5' COI region). The number of haplotypes (h), 
polymorphic sites (s), and nucleotide diversity were determined 
using DnaSP v5.10 (http://www.ub.edu/dnasp) [59]. 

The nucleotide compositions and sequence divergences within 
and between species were calculated using the distance model 
Kimura two-parameter (K2P) [60]. A dendrogram of K2P 
distance Neighbor-Joining (NJ) was generated to provide a 
graphical representation of the clustering pattern across different 
species [61]. 

All of the sequences obtained in the present study have been 
submitted to GenBank (http://www.ncbi.nlm.nih.gov/genbank/). 
The collection locations and other information regarding the 
specimens, such as the GenBank accession numbers, will be 
available on the BOLD Systems (http://www.boldsystems.org) 
platform, as the "Phlebotominae Barcoding Initiative of the 
Americas - PBIN" project. 

Results 

Specimens collected 

A total of 36 phlebotomine species belonging to 3 genera were 
collected from 1 1 Colombian departments (Table SI). Some of the 
collected species had epidemiological histories (Table 1): Lutzomyia 
columbiana (Ristorcelli and Van Ty, 1941), Lutzomyia evansi (Nunez- 
Tovar, 1924), Lutzomyia gomezi (Nitzulescu, 1931), Lutzomyia 
hartmanni (Fairchild & Hertig, 1957 ), Lutzomyia longiflocosa Osorno, 
Morales Osorno and Mufioz, 1970, Lutzomyia longipalpis (Lutz & 
Neiva, 1912), Lutzomyia nuneztovari (Ortiz 1954), Lutzomyia pana- 
mensis (Shannon, 1926), Lutzomyia pia (Fairchild & Hertig, 1961), 
Lutzomyia shannoni (Dyar, 1929), Lutzomyia trapidoi (Fairchild & 
Hertig, 1952), Lutzomyia youngi Feliciangeli & Murillo, 1987, and 
Lutzomyia yuilli yuilli Young & Porter, 1972. 

Sequence analysis 

The 5' fragment of the COI mitochondrial gene was amplified 
and sequenced for each sample (n= 148), and the total size was 
700 bp. The identities of the obtained sequences corresponded 
with positions 16 to 716 of the COI mitochondrial gene on Aedes 
aegypti (NC_0 1024 1.1), which was used as the reference genome. 
This segment also corresponded with that proposed by Hebert et 
al. [41] as a barcode region for species identification. No 
insertions, deletions, or stop codons were observed indicating the 
absence of pseudogenes or nuclear copies of mitochondrial origin 
(NUMTs).The sequences from the 36 species of phlebotomine 
sand flies that had previously been identified based on morphology 
were represented by 1 to 16 individuals (Table SI). The 
identification numbers and GenBank accession numbers for the 
specimens are summarized in table SI. 

During alignment at the nucleotide level, a total of 417 
conserved sites and 283 variable sites were observed. The obtained 
COI sequences contained a large number of A+T pairs (an 
average of 66.3% for all codons), particularly in the third codon 
position (87.6%) (Table S2). 

Of the 148 sequences obtained, 103 haplotypes were charac- 
terized for phlebotomine sand flies, indicating a high degree of 
diversity. The number of haplotypes within a species ranged from 
1 to 9 (Table 1), and variable haplotypes were obtained in most 
species, except for Brumptomyia mesai, Brumptomyia beaupertuyi, 
Lutzomyia bifoiiata, Lutzomyia carpenteri, Lutzomyia nuneztovari, Lutzo- 
myia sordellii, Lutzomyia walkeri, and Lutzomyia [Heicocyrtomyia) sp. 1, 
which exhibited unique haplotypes for different specimens. The 
largest numbers of haplotypes were observed in Lutzomyia gomezi (9 
haplotypes), Lutzomyia scorzai (7), Lutzomyia trapidoi (9), Lutzomyia 
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Figure 2. Genetic divergences (K2P distances) between COI sequences for varied taxonomic levels of Colombian sand fly species. 

The frequency of pairwise divergences among specimens both within species and among different species. 
doi:1 0.1 371 /journal.pone.0085496.g002 



panamensis (6), Lutzomyia trinidadensis (6), Lutzomyia triramula (6), 
Lutzomyia columbiana (5), and Lutzomyia carrerai thula (5) (Table 1). 

Regarding the numbers and types of substitutions observed 
among different haplotypes of the same species, the results showed 
between 2 to 40 variable positions (Table 1), and nucleotide 
changes were primarily synonymous substitutions, such as 
transitions (purine for purine and pyrimidine for pyrimidine), at 
the third positions. 

Pattern of sequence divergence 

The set of sequences obtained was used to evaluate efficacy of 
the barcoding method to delineate species. Genetic distances, as 
determined by the K2P intraspecies values, ranged between 0 and 
6% (average intraspecific divergence = 1.6%, standard devia- 
tion =1.38%), and maximum intraspecific divergence was ob- 
served in 8 of the 36 species (Table 2). Between species, the genetic 
distances showed K2P values between 9 and 26.7% (average 
divergence = 19%, standard deviation = 2.57%). 

Given the presence of the "barcode gap" as a delimiting 
criterion and an indicator of species differentiation [62,63], it was 
determined that the distributions of all intraspecific paired 
distances and the interspecific distributions of distances were not 
superimposed. These differences indicated genetic variations 
between species (Figure 2). 

Neighbor-joining dendrogram (NJ) 

NJ dendrogram (Figure 3), shows the limits representing 
different taxonomic groups and differentiation of species. Further- 
more, specimens of the same species were always observed to be 
closely grouped, regardless of the collection site; for the Molecular 
Operational Taxonomic Units (MOTU), 36 taxonomic groups or 
units were identified, compared with the 36 species that were 
recognized by traditional taxonomy based on morphological 
characters. The nodes connecting the sequences of individuals 
from the same species were supported by high bootstrap values 
(99%-100%). 



Final identification 

Using the divergence of K2P distances, NJ dendrogram 
bootstrapping, and traditional morphological approaches, we 
found COI sequence always corresponded with species' identifi- 
cations derived from morphologic characters (Figure 3). 

Discussion 

DNA barcode is a system designed to provide rapid, accurate, 
and automatable species-identification based on the standard 
analysis of short regions of genes to produce species tag [64].. The 
present study evaluated the efficacy of using COI barcodes for the 
identification of phlebotomine sand fly species in Colombia and 
provided a group of sequences associated with the identified 
species. For most species examined in the present study, a coherent 
matrix of DNA barcode sequences was obtained, which differen- 
tiated each species. 

The 5' region of the mitochondrial COI gene is flanked by 
highly conserved regions, as noted by Hebert & Gregory [64] . The 
absence of stop codons was consistent for all of the amplified 
sequences, suggesting the functionality of the COI mitochondrial 
sequences. These sequences were 700 bp in length, suggesting that 
NUMTs were not sequenced [65] . 

The nucleotide lengths of amplified sequences of the COI gene 
(700 bp) in the present study were longer than those reported in 
previous studies performed in Phlebotominae: 549 bp reported by 
Azpurua et al. [28], 681 bp by Cohnstaedt et al. [50], 689 bp by 
Kumar et al. [49], and 548 bp reported by Hoyos et al. [51]. 

Within the DNA barcode sequences in phlebotomine sand flies, 
high A+T contents were observed, which were within average 
values (68.1 to 70.7%) reported for species of the order Diptera, 
including suborders Nematocera and Brachycera [66,67] . Similar 
values of A+T contents (66.7%) were observed in Phlebotomus and 
Sergentomyia in studies examining COI [28,49-51]. There were no 
significant variations in the percentage of G+C content among the 
sequences of Phlebotominae from the different localities studied, 
and no diagnostic correlation for the percentage of G+C content 
was observed. The G+C content observed for the phlebotomine 
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— Lutzomyia gomezi KC921253 

— Lutzomyia gomezi KC921254 

— Lutzomyia gomezi KC921247 

— Lutzomyia gomezi JN845545 

— Lutzomyia gomezi KC9S1 25 1 

— Lutzomyia gomezi KC921249 

— Lutzomyia gomezi KC921250 

— Lutzomyia gomezi KC921251 

— Lutzomyia gomezi KC921248 

— Lutzomyia trapidoi KC921299 

— Lutzomyia trapidoi KC921304 

— Lutzomyia trapidoi KC921305 

— Lutzomyia trapidoi KC921300 

— Lutzomyia trapidoi JN845552 

— Lutzomyia trapidoi KC921301 

— Lutzomyia trapidoi KC921302 

— Lutzomyia trapidoi M845551 

— Lutzomyia trapidoi KC921303 

— Lutzomyia reburra KC921283 

— Lutzomyia reburia KC921284 

— Lutzomyia reburra KC921282 

— Lutzomyia shannon! KC921296 

— Lutzomyia antunesi JN845548 

— Lutzomyia antunesi JN 845549 

— Lutzomyia yuilti KC921323 

— Lutzomyia yuilli KC921319 

— Lutzomyia yuilti KC921324 

— Lutzomyia yuilti KC921321 

— Lutzomyia yuilli KC921325 

— Lutzomyia yuilli KC921322 

— Lutzomyia yuilti KC921320 

- Lutzomyia yuilli KC921326 

- Lutzomyia caneraithula KC921239 

- Lutzomyia caneraithula KC92I241 

— Lutzomyia caneraithula KC921238 

— Lutzomyia caneraithula KC921237 

— Lutzomyia caneraithula KC921240 

— Lutzomyia panamensis KC921262 

— Lutzomyia panamensis KC921264 

— Lutzomyia panamensis KC921 266 

— Lutzomyia panamensis KC921265 

— Lutzomyia panamensis KC921263 

— Lutzomyia panamensis JN845553 




- Lutzomyia (Lutzomyia) Sp JN845544 

- Lutzomyia longipalpis JN845540 

- Lutzomyia longipalpis JN845542 

- Lutzomyialongipalpis JN845539 

- Lutzomyia longipalpis M84S541 

- Lutzomyia migonei LumigcosucoiOl 

- Lutzomyia migonei Lumigcosucoi04 

- Lutzomyia migonei Lumigcosucoi02 

- Lutzomyia migonei Lumigcosucot03 

- Lutzomyia Oatretori majuscuta KC92122? 

- Lutzomyia barreloi majuscuta KC921228 

- Lutzomyia barretoi majuscuta KC921 226 

- Lutzomyia carpenteh KC921235 

- Lutzomyia carpenteh KC921236 

- Lutzomyia coulinhoi KC921230 

- Lutzomyia coulinhoi KC921229 

- Lutzomyia coulinhoi KC921231 

- Lutzomyia sordelli KC921298 

- Lutzomyia sordelli KC921297 

- Lutzomyia columbiana KC921242 

- Lutzomyia columbiana KC921243 

- Lutzomyia columbiana KC921246 

- Lutzomyia columbiana KC921244 

- Lutzomyia columbiana KC921245 

- Lutzomyia pia ¥.0921279 

- Lutzomyia pia KC921273 

- Lutzomyia pia KC921272 

- Lutzomyia pia KC92J276 

- Lutzomyia pia KC921281 

- Lutzomyia pia KC921267 

- Lutzomyia pia KC921270 

- Lutzomyia pia KC92127S 

- Lutzomyia pia KC921268 

- Lutzomyiapia KC921278 

- Lutzomyiapia KC921277 

- Lutzomyiapia KC921269 

- Lutzomyia pia KC92I280 

- Lutzomyiapia KC921271 
-Lutzomyia pia KC921 274 

- Brumplomyia guimareasi KC921223 

- Brumplomyia guimareasi KC921225 

- Brumplomyia guimareasi KC921224 

- Brumplomyia hamata BrhamsfanKoOl 

- Brumplomyia hamata Brhamsfanlco02 

- Brumplomyia beapertuyi KC921222 

- Brumplomyia mesa/ BrmesaochcolOI 

- Brumplomyia mesat Brmesacchcol02 

- Lutzomyia bifoliata KC921233 

- Lutzomyia biioliata KC921234 

- Lutzomyia bitoliata KC921232 

- Lutzomyia lichyi KC921259 

- Lutzomyia hartmanni KC921255 
niKC921256 



- Lutzomyia hartmanni KC921257 

- Lutzomyia hartmanni KC921258 

- Lutzomyia (Helcocyrtomyia) sp.1 KC921294 

- LutzomyialHelCocyrtomyia) sp. 1 KC921295 

- Lutzomyia scorzai KC921289 

- Lutzomyia scorzai KC921290 

- Lutzomyi scorzai KC921293 

- Lutzomyia scorzai KC921285 

- Lutzomyia scorzai KC921291 

- Lutzomyia scorzai KC921286 

- Lutzomyia scorzai KC921292 

- Lutzomyia scorzai KC921287 

- Lutzomyia scorzai KC921288 

- Lutzomyia velkery KC921318 

- Lutzomyia velkery KC921317 

- Lutzomyia cayennensis cayennensis JN845561 

- Lutzomyia nuneztovah KC921260 

- Lutzomyia nuneztovah KC921261 

- Lutzomyia spinicrassa JN845528 

- Lutzomyia spinicrassa JN845529 
'a spinicrassa JW845530 



- Lutzomyia spinicrassa JN845531 

- Lutzomyia youngi KC921327 

- Lutzomyia longiSocosa JN845533 

- Lutzomyia longiflocosa JN845532 

- Lutzomyia longiBocosa JN845534 

- Lutzomyia longiSocosa JN845535 

- Lutzomyia evansi JN84S536 
iaevansiJN845537 



- Lutzomyia evans, JN845538 

- Lutzomyia triramula KC921310 

- Lutzomyia triramula KC921314 

- Lutzomyia triramula KC921312 

- Lutzomyia triramula KC921315 

- Lutzomyia triramula KC921316 

- Lutzomyia triramula KC921311 



— Lutzomyia 

— Lutzomyia 

— Lutzomyia 

— Lutzomyia tri 

— Lutzomyia tri 

— Lutzomyia tri 

— Lutzomyia tri 

— Lutzomyia tri 



'a KC921313 
is JN845555 
is KC921306 



'S JN8< 



:is JN845556 
:is JN845559 
iis JN845560 

- Warileya rotundipenms KC921328 

- Warileya rotundipennis KC921329 

- Warileya rotundipennis KC921330 



Figure 3. Neighbor-joining analysis of mitochondrial COI 
sequences of Colombian sand fly species. 

doi:10.1371/journal.pone.0085496.g003 

sand flies is similar to that of other insects and lower categories on 
the zoological scale [68], 

In terms of codon usage, a preference for A or T termination 
within the triplets was clearly observed. This finding is supported 
by the fact that the base composition of mitochondrial DNA is 
highly correlated with codon usage because genes encoding 
mitochondrial proteins exhibit a preference for the use of codons 
rich in A+T. This phenomenon appears to be a characteristic of 
the Insecta mitochondrial genome [69,70]. These values are 
similar to those observed other Diptera [66,70-75] and, in 
particular, for species in the genus Lutzomyia, for which different 
mitochondrial gene fragments have been analyzed, including 
Cytochrome b (A+T = 73.09%), tRNA serim; (A+T = 81.2%), and 
ND4 (A+T=72.5%) [31,35,37]. 

Regarding differences in the numbers and types of substitutions 
among different haplotypes from the same species, the results 
revealed 2 to 40 variable positions changes were largely 
synonymous substitutions of the transitional type (purine by 
purine and pyrimidine by pyrimidine) at third positions, which do 
not affect the amino acid composition of the protein. In contrast, 
between species, nucleotide substitution rates gradually increased 
in relation to genetic distances. Blouin et al. [76] suggested that 
nucleotide substitution patterns and rate of divergence between 
sites provided diagnostic information, thus, indicating the value of 
DNA sequences as molecular tools for differentiating species. 

Aliabadian et al. [77] indicated that the success of barcode 
identification based on genetic distances ultimately depended on 
differences between intra- and interspecific divergences; ideally, 
there is no barcode overlap between the distributions of these two 
classes of distance. According to Hebert et al. [62], the barcoding 
gap (i.e., the difference between intra- and interspecific distances) 
will allow assignment of categories or species taxonomic status to 
specimens once barcode reference libraries are complete and 
available. The present study confirms that, among the taxa 
examined, genetic distances and barcoding gaps within a species 
are generally much smaller than those between individuals of 
different species. 

The use of DNA barcoding can delineate the boundaries 
between species and assign taxonomic status to unknown 
individuals from known species. The analysis of genetic distances 
is one of the criteria for acquiring this information [62]. The range 
of intraspecific variation was between 0 and 6%, while the range of 
interspecific variability was between 9 and 26.7%. 

Hebert et al. [41] proposed an interspecific limit of 3% for 
insects. The high intraspecific variability (>2%) observed between 
different haplotypes of L. gomezi (6% maximum intraspecific 
variability), L, panamensis (4.6%), and L. trinidadensis (3.4%) (Table 2) 
could suggest a population differentiation or the presence of 
cryptic species [51,62]. This is a reflection of the large number of 
mutations observed in relation to the other sequences. Also,the 
wide intraspecific divergence could be related to rapid variation of 
the mitochondrial genome [78]; range or distribution of a species, 
which allows for isolation and genetic differentiation; and presence 
of ancestral polymorphisms, which may be conserved over time 
due to the aggregate population distribution characteristic of 
phlebotomine sand flies [79]. 

The low intraspecific variability observed may be the effect of 
sampling [80,81]. Sample size per species ranged from 1 to 16 
specimens, but only two species had more than nine individuals 
(Table 1). The 6% nucleotide divergence (well above the 3% level 
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often considered an upper limit for intraspecific variation) 
observed for L. gomezi and this species' extensive geographic 
distribution (from Central America to South Region of Brazil) 
suggests the present morphospecies may be comprised of two or 
more cryptic species. 

In the present study, we found a nucleotide divergence range 
between 9% and 26.7% for differentiation of the 36 species of 
phlebotomine sand flies identified. 

In general, our results revealed that there was no overlap 
between the levels of intra- and interspecific divergence, the 
previously mentioned barcode gap, even though higher values of 
intraspecific divergence were observed. This separation confirms 
the validity and partial effectiveness of the DNA barcode sequence 
as a molecular identifier of species [40] . 

The geographical scale of sampling has a critical impact on the 
global application of DNA barcoding [81]. The dependency of 
intraspecific genetic variation on geographical scale of sampling is 
to be expected based on widely recognized theory and concepts 
such as distance decay and isolation by distance, as well as from 
phylogeographic studies. As a general rule, a species sampled 
throughout its geographical range will reveal greater genetic 
variation than if the variation was estimated from a single smaller 
region [44,45,63,78,80-85]. The phlebotomine sand flies species 
in our study were not sampled across its entire distribution range 
in Colombia and it will be desirable. However, when our data set 
was compared with published sequences in genbank, the range of 
intra and interspecific divergence values (0-6% and 9-26.7%) did 
not change. This evidence suggests accurate identification of our 
phlebotomine sand flies. Consequently, data derived from the COI 
sequence fragment were found to be helpful for species identifi- 
cation, and may be applied in the future for identifying species 
from different regions. 

The NJ method is conceptually related to grouping, but does 
not involve a phylogenetic tool, the natural history, or the behavior 
of a molecular clock [86]. Combining a NJ dendrogram with a 
bootstrap analysis is the most appropriate method for evaluating 
trees using distance methods [87]. The COI NJ dendrogram 
confirmed that the species of a genus generally form cohesive 
groups. 

In the present study, all MOTUs, including individuals 
belonging to the same species, were supported by the high 
bootstrap value of 100%, thus validating the consistency and 
usefulness of the COI barcode sequence to assign individuals to 
groups that corresponded to the species previously defined by 
specialists based on morphological features. Haplotypes from 
single species formed barcode groups that were clearly distin- 
guishable from related species. As expected, some species that 
were represented by single individuals were not grouped with any 
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